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4 <110> APPLICANT: Yanagi, Masayuki 

5 Emerson, Suzanne 

6 Bukh, Jens 

7 Purcell, Robert 

9 <12 0> TITLE OF INVENTION: CLONED GENOME OF INFECTIOUS HEPATITIS C 
10 VIRUSES OF GENOTYPE 2a AND USES THEREOF 

13 <130> FILE REFERENCE: NIH255.001NP 

15 <140> CURRENT APPLICATION NUMBER: US 09/980, 559C 

16 <141> CURRENT FILING DATE: 2002-05-14 

18 <150> PRIOR APPLICATION NUMBER: PCT/US00/15446 

19 <151> PRIOR FILING DATE: 2000-06-02 

21 <150> PRIOR APPLICATION NUMBER: US 60/137,693 

22 <151> PRIOR FILING DATE: 1999-06-04 
24 <160> NUMBER OF SEQ ID NOS : 70 

26 <170> SOFTWARE: FastSEQ for Windows Version 4.0 

28 <210> SEQ ID NO: 1 

29 <211> LENGTH: 9711 

30 <212> TYPE: DNA 

31 <213> ORGANISM: Hepatitis C virus 

33 <400> SEQUENCE: 1 

34 acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag gaactactgt 60 

3 5 cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc ctccaggccc 12 0 

36 ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg aattgccggg 180 

37 aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg cgtgcccccg 240 
3 8 caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg cctgataggg 3 00 

3 9 tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa atcctaaacc 360 

40 tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt ttccgggcgg 42 0 

41 cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt tgggtgtgcg 480 

42 cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc ccatccctaa 540 

43 agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc ccctatacgg 600 

44 gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc gtccctcttg 660 

45 gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg ataccctaac 72 0 

46 gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780 

4 7 cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 84 0 
48 gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 
4 9 cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 

50 caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 1020 

51 cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 

52 cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140 

53 gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200 

54 agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 

55 catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320 

56 gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 1380 
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57 cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 

58 gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 

59 tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agcttatttg acatgggccc 1560 

60 caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 162 0 

61 gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 

62 caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740 

63 gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 

64 ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 

65 agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 1920 

66 cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 

67 gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 2040 

68 cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 

69 ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 

70 cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220 

71 ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 22 80 

72 atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 

73 tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400 

74 cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 

75 tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 

76 cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 

77 ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 

78 tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 

79 ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 

80 gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820 

81 ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 

82 gtggtggttg tgctatcttc tgaccctggg ggaagctatg gtccaggagt gggcaccacc 2 940 

83 tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3 000 

84 tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 

85 aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 3120 

86 catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 

87 gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 3240 

88 cctgcgggac ctggcggtcg ccgttgagcc tatcatcttc agtccgatgg agaagaaagt 33 00 

89 cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 3360 

90 cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 

91 gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 3480 

92 agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 3540 

93 cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 

94 tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 

95 tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 3 72 0 

96 cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3 780 

97 aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 3 840 

98 gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3900 

99 tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3960 

100 cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 

101 tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 

102 atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 

103 gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 4200 

104 gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc tcgccgatgg 4260 

105 gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 4320 
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106 taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 4380 

107 aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 4440 

108 ggaggtggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 

109 ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 

110 cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4620 

111 ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 

112 gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 4740 

113 cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4800 

114 acgtagccag cgccggggtc gcacgggtag gggaagactg ggcatttata ggtatgtttc 4860 

115 cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4920 

116 aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4980 

117 caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 5040 

118 cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 

119 cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 

120 ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 

121 tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 5280 

122 atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 

123 agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 5400 

124 cggccgcttg cacattaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 5460 

125 ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 

126 gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 
12 7 tcaagacata caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 5640 
128 acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 
12 9 gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 

130 aagcaccact atccttctca acattttggg gggctggcta gcatcccaaa ttgcaccacc 5820 

131 cgcgggggcc actggcttcg ttgtcagtgg cctagtggga gctgccgtag gcagtatagg 5880 

132 cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 

133 cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 

134 gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 

135 ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 6120 

136 ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 

137 tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 6240 

138 gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 6300 

139 ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 
14 0 gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 642 0 

141 catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 6480 

142 catgagaatc acggggccta agacctgcat gaatatctgg caggggacct ttcctatcaa 6540 

143 ttgttacacg gagggccagt gcgtgccgaa acccgcgcca aactttaagg tcgccatctg 6600 

144 gagggtggcg gcctcagagt acgcggaggt gacgcagcac gggtcatacc actacataac 6660 

145 aggactcacc actgataact tgaaagtccc ctgccaacta ccctctcccg agttcttttc 6720 

146 ctgggtggac ggagtgcaga tccataggtt tgcccccaca ccgaagccgt ttttccggga 6780 

147 tgaggtctcg ttctgcgttg ggcttaattc atttgtcgtc gggtcccagc ttccttgcga 6840 

148 ccctgaaccc gacacagacg tattgatgtc catgctaaca gatccatctc atatcacggc 6900 

149 ggagactgca gcgcggcgtt tagcgcgggg gtcaccccca tccgaggcaa gctcctcggc 6960 

150 gagccagcta tcggcaccat cgctgcgagc cacctgcacc acccacggca aagcctatga 7020 

151 tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 7080 

152 gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 7140 

153 gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 7200 

154 ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 7260 
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155 accggccact gttgcgggct gtgctctccc tcctcctagg 

156 aaggaggcgc cggacagtgg gcctaagtga ggactccata 

157 ggccattaag tcctttggcc agcccccccc aagcggcgat 

158 gggcgctgcc gattccggca gtcagacgcc tcctgatgag 

159 ttccatctct tccatgcccc ccctcgaggg ggagcttgga 

160 gcaggtagag ccccaacccc ccccccaggg gggggtggca 

161 gtcctggtct acttgctccg aggaggacga ctccgtcgtg 

162 ctggaccggg gctctaataa ctccttgtag tcccgaagag 

163 cttgagcaac tccctgttgc gatatcacaa caaggtgtac 

164 ctcactaagg gctaaaaagg taacttttga taggatgcaa 

165 ctcagtctta aaggacatta agctagcggc ctccaaggtc 

166 ggaggaggct tgccagttaa ccccacccca ttctgcaaga 

167 taaggaggtc cgcagcttgt ccgggagggc cgttaaccac 

168 cctcctggag gactcagaaa caccaattcc cacaaccatt 
16 9 ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc 

170 cggcgtcagg gtctgcgaga agatggccct ttatgacatt 

171 ggtgatgggg gcttcttatg gattccagta ttcccccgct 

172 gaaagcatgg gcggaaaaga aggaccctat gggtttttcg 

173 ctcaaccgtc actgagagag acatcaggac tgaggagtcc 

174 gcccgaggag gcccacactg ccatacactc gctaactgag 

175 tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt 

176 caccactagc atggggaaca ccatcacatg ctacgtgaaa 

177 tgcagggata atcgcgccca caatgctggt atgcggcgat 

178 aagccagggg accgaggagg acgagcggaa cctgagagcc 

179 gtattctgcc cctcctggtg acccccccag accggagtat 

180 ttgctcctca aatgtgtctg tggcgctggg cccacaaggc 

181 cagagaccct accactccaa tcgcccgggc tgcctgggaa 

182 caattcatgg ctgggaaaca tcatccagta cgccccgacc 

183 gatgacacac ttcttctcca ttctcatggc tcaagacacg 

184 tgagatgtac ggagcggtgt actccgtgag tcccttggac 

185 gttacatggg cttgacgctt tttctctgca cacatacact 

186 ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga 

187 tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg 

188 cttcaattgg gcggtgaaga ccaagctcaa actcactcca 

189 ggatttatcc agctggttca ccgtcggcgc cggcgggggc 

190 gcgtgcccga ccccgcttat tgctctttgg cctactccta 

191 tttcctactc cccgctcggt agagcggcac acattagcta 

192 cttttttttt tttttttttt tttttttttt tttttttttt 

193 tttttttttt tttttttttt tttttctttt tttctctttt 

194 tactttcttt cctggtggct ccatcttagc cctagtcacg 

195 gagccgcatg actgcagaga gtgccgtaac tggtctctct 

197 <210> SEQ ID NO: 2 

198 <211> LENGTH: 3033 

199 <212> TYPE: PRT 

200 <213> ORGANISM: Hepatitis C virus 
2 02 <400> SEQUENCE: 2 

2 03 Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr 

204 1 5 10 

2 05 Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly 



aaaaccccga cgcctccccc 7320 
ggagatgccc ttcaacagct 7380 
tcaggccttt ccacgggggc 7440 
ttggcccttt cggagacagg 7500 
gatccagacc tggagcctga 7560 
gctcccggct cggactcggg 7620 
tgctgctcca tgtcatactc 7680 
gagaagttac cgattaaccc 7740 
tgtaccacaa caaagagcgc 7800 
gtgctcgact cctactacga 7860 
accgcaaggc tcctcaccat 7920 
tctaaatatg ggtttggggc 7980 
atcaagtccg tgtggaagga 8040 
atggccaaaa atgaggtgtt 8100 
cttatcgttt accctgacct 8160 
acacaaaaac ttcctcaggc 8220 
cagcgggtag agtttctctt 8280 
tatgataccc gatgctttga 8340 
atatatcggg cctgctcctt 8400 
agactttacg tgggagggcc 8460 
tgccgcgcca gcggggtgct 8520 
gccttagcgg cttgtaaagc 8580 
gacttggttg tcatctcaga 8640 
ttcacggagg ctatgaccag 8700 
gatctggagc tgataacatc 8760 
cgccgcagat actacctgac 8820 
acagttagac actcccctgt 8880 
atatgggctc gcatggtcct 8940 
ctggaccaga acctcaactt 9000 
ctcccagcta taattgaaag 9060 
ccccacgaac tgacacgggt 912 0 
gcgtggaaga gccgggcacg 9180 
gccgtttgcg gtcgatatct 9240 
ttgccggaag cgcgcctcct 9300 
gacatttatc acagcgtgtc 9360 
ctttttgtag gggtaggcct 9420 
cactccatag ctaactgtcc 9480 
tttttttttt tttttttttt 9540 
ccttctttct taccttattt 9600 
gctagctgtg aaaggtccgt 9660 
gcagatcatg t 9711 



Lys Arg Asn Thr Asn 
15 

Gly Gin lie Val Gly 
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Ser 


Thr 


Gly 


Tyr 


Met 


Val 


Thr 


Asn 


Asp 


Cys 


Thr 


228 






195 










200 










205 








229 


Asn 


Asp 


Ser 


He 


Thr 


Trp 


Gin 


Leu 


Gin 


Ala 


Ala 


Val 


Leu 


His 


Val 


Pro 


230 




210 










215 










220 










231 


Gly 


Cys 


Val 


Pro 


Cys 


Glu 


Lys 


Val 


Gly 


Asn 


Ala 


Ser 


Gin 


Cys 


Trp 


He 


232 


225 










230 










235 










240 


233 


Pro 


Val 


Ser 


Pro 


Asn 


Val 


Ala 


Val 


Gin 


Arg 


Pro 


Gly 


Ala 


Leu 


Thr 


Gin 


234 










245 










250 










255 




235 


Gly 


Leu 


Arg 


Thr 


His 


He 


Asp 


Met 


Val 


Val 


Met 


Ser 


Ala 


Thr 


Leu 


Cys 


236 








260 










265 










270 






237 


Ser 


Ala 


Leu 


Tyr 


Val 


Gly 


Asp 


Leu 


Cys 


Gly 


Gly 


Val 


Met 


Leu 


Ala 


Ala 


238 






275 










280 










285 








239 


Gin 


Met 


Phe 


He 


Val 


Ser 


Pro 


Gin 


His 


His 


Trp 


Phe 


Val 


Gin 


Asp 


Cys 


240 




290 










295 










300 










241 


Asn 


Cys 


Ser 


He 


Tyr 


Pro 


Gly 


Thr 


He 


Thr 


Gly 


His 


Arg 


Met 


Ala 


Trp 


242 


305 










310 










315 










320 


243 


Asp 


Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Ala 


Thr 


Met 


He 


Leu 


Ala 


Tyr 


244 










325 










330 










335 




245 


Ala 


Met 


Arg 


Val 


Pro 


Glu 


Val 


He 


He 


Asp 


He 


He 


Ser 


Gly 


Ala 


His 


246 








340 










345 










350 






247 


Trp 


Gly 


Val 


Met 


Phe 


Gly 


Leu 


Ala 


Tyr 


Phe 


Ser 


Met 


Gin 


Gly 


Ala 


Trp 


248 






355 










360 










365 








249 


Ala 


Lys 


Val 


Val 


Val 


He 


Leu 


Leu 


Leu 


Ala 


Ala 


Gly 


Val 


Asp 


Ala 


Arg 


250 




370 










375 










380 










251 


Thr 


His 


Thr 


Val 


Gly 


Gly 


Ser 


Ala 


Ala 


Gin 


Thr 


Thr 


Gly 


Arg 


Leu 


Thr 


252 


385 










390 










395 










400 


253 


Ser 


Leu 


Phe 


Asp 


Met 


Gly 


Pro 


Arg 


Gin 


Lys 


He 


Gin 


Leu 


Val 


Asn 


Thr 


254 










405 










410 










415 
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4 <110> APPLICANT: Yanagi, Masayuki 

5 Emerson, Suzanne 

6 Bukh , Jens 

7 Purcell, Robert 

9 <120> TITLE OF INVENTION: CLONED GENOME OF INFECTIOUS HEPATITIS C 
10 VIRUSES OF GENOTYPE 2a AND USES THEREOF 

13 <130> FILE REFERENCE: NIH255.001NP 

15 <140> CURRENT APPLICATION NUMBER: US 09/980, 559C 

16 <141> CURRENT FILING DATE: 2002-05-14 



18 <150> PRIOR APPLICATION NUMBER: PCT/US00/15446 ' "^—'Q 



19 <151> PRIOR FILING DATE: 2000-06-02 

21 <150> PRIOR APPLICATION NUMBER: US 60/137,693 

22 <151> PRIOR FILING DATE: 1999-06-04 
24 <160> NUMBER OF SEQ ID NOS : 70 
26 <170> SOFTWARE 



FastSEQ for Windows Version 4.0 (TV** ^OlJlnli/ 



ERRORED SEQUENCES 



28 <210> SEQ ID NO: 1 

29 <211> LENGTH: 9711 

30 <212> TYPE: DNA /Q J 

31 <213> ORGANISM: Hepatitis C virus (i'^A* 

33 <400> SEQUENCE: 1 1 

34 acccgcccct aataggggcg acactccgcc atgaatcact cccctgtgag 

35 cttcacgcag aaagcgtcta gccatggcgt tagtatgagt gtcgtacagc 

36 ccccctcccg ggagagccat agtggtctgc ggaaccggtg agtacaccgg 

37 aagactgggt cctttcttgg ataaacccac tctatgcccg gccatttggg 
3 8 caagactgct agccgagtag cgttgggttg cgaaaggcct tgtggtactg 

3 9 tgcttgcgag tgccccggga ggtctcgtag accgtgcacc atgagcacaa 

4 0 tcaaagaaaa accaaaagaa acaccaaccg tcgcccacaa gacgttaagt 

41 cggccagatc gttggcggag tatacttgtt gccgcgcagg ggccccaggt 

42 cgcgacaagg aagacttcgg agcggtccca gccacgtgga aggcgccagc 

43 agatcggcgc tccactggca aatcctgggg aaaaccagga tacccctggc 

44 gaatgaggga ctcggctggg caggatggct cctgtccccc cgaggttccc 

45 gggccccaat gacccccggc ataggtcgcg caacgtgggt aaggtcatcg 

46 gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc 

47 cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt 

48 gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct 
4 9 cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga 

50 caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc 

51 cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac 

52 cqtqcaqcqq cccqqcqccc tcacqcaqqq cttqcqqacq cacatcqaca 
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gaactactgt 


60 


ctccaggccc 


120 


aattgccggg 


180 


cgtgcccccg 


240 


cctgataggg 


300 


atcctaaacc 


360 


ttccgggcgg 


420 


tgggtgtgcg 


480 


ccatccctaa 


540 


ccctatacgg 


600 


gtccctcttg 


660 


ataccctaac 


720 


tcggcggcgt 


780 


ttgcaacagg 


840 


gcatcaccac 


900 


ctaacgactg 


960 


ccgggtgcgt 


1020 


cgaatgtggc 


1080 


tggttgtgat 


1140 



2 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/980 , 559C 



DATE: 03/24/2006 
TIME: 18:31:17 



Input Set : A:\PTO.txt 

Output Set: N:\CRF4\03242006\l980559C.raw 

151 tgtggacatg gtggatgcta acctgttcat ggggggcgat gtgactcgga tagagtctgg 

152 gtccaaagtg gtcgttctgg actctctcga cccaatggtc gaagaaagga gcgaccttga 

153 gccttcgata ccatcagaat acatgctccc caagaagagg ttcccaccag ctttaccggc 

154 ctgggcacgg cctgattaca acccaccgct tgtggaatcg tggaaaaggc cagattacca 

155 accggccact gttgcgggct gtgctctccc tcctcctagg aaaaccccga cgcctccccc 

156 aaggaggcgc cggacagtgg gcctaagtga ggactccata ggagatgccc ttcaacagct 

157 ggccattaag tcctttggcc agcccccccc aagcggcgat tcaggccttt ccacgggggc 

158 gggcgctgcc gattccggca gtcagacgcc tcctgatgag ttggcccttt cggagacagg 

159 ttccatctct tccatgcccc ccctcgaggg ggagcttgga gatccagacc tggagcctga 

160 gcaggtagag ccccaacccc ccccccaggg gggggtggca gctcccggct cggactcggg 

161 gtcctggtct acttgctccg aggaggacga ctccgtcgtg tgctgctcca tgtcatactc 

162 ctggaccggg gctctaataa ctccttgtag tcccgaagag gagaagttac cgattaaccc 

163 cttgagcaac tccctgttgc gatatcacaa caaggtgtac tgtaccacaa caaagagcgc 

164 ctcactaagg gctaaaaagg taacttttga taggatgcaa gtgctcgact cctactacga 

165 ctcagtctta aaggacatta agctagcggc ctccaaggtc accgcaaggc tcctcaccat 

166 ggaggaggct tgccagttaa ccccacccca ttctgcaaga tctaaatatg ggtttggggc 

167 taaggaggtc cgcagcttgt ccgggagggc cgttaaccac atcaagtccg tgtggaagga 

168 cctcctggag gactcagaaa caccaattcc cacaaccatt atggccaaaa atgaggtgtt 

169 ctgcgtggac cccaccaagg ggggcaagaa agcagctcgc cttatcgttt accctgacct 

170 cggcgtcagg gtctgcgaga agatggccct ttatgacatt acacaaaaac ttcctcaggc 

171 99t9 at 9999 gcttcttatg gattccagta ttcccccgct cagcgggtag agtttctctt 

172 gaaagcatgg gcggaaaaga aggaccctat gggtttttcg tatgataccc gatgctttga 

173 ctcaaccgtc actgagagag acatcaggac tgaggagtcc atatatcggg cctgctcctt 

174 gcccgaggag gcccacactg ccatacactc gctaactgag agactttacg tgggagggcc 

175 tatgttcaac agcaagggcc aaacctgcgg gtacaggcgt tgccgcgcca gcggggtgct 

176 caccactagc atggggaaca ccatcacatg ctacgtgaaa gccttagcgg cttgtaaagc 

177 tgcagggata atcgcgccca caatgctggt atgcggcgat gacttggttg tcatctcaga 

178 aagccagggg accgaggagg acgagcggaa cctgagagcc ttcacggagg ctatgaccag 

179 gtattctgcc cctcctggtg acccccccag accggagtat gatctggagc tgataacatc 

180 ttgctcctca aatgtgtctg tggcgctggg cccacaaggc cgccgcagat actacctgac 

181 cagagaccct accactccaa tcgcccgggc tgcctgggaa acagttagac actcccctgt 

182 caattcatgg ctgggaaaca tcatccagta cgccccgacc atatgggctc gcatggtcct 

183 gatgacacac ttcttctcca ttctcatggc tcaagacacg ctggaccaga acctcaactt 

184 tgagatgtac ggagcggtgt actccgtgag tcccttggac ctcccagcta taattgaaag 

185 gttacatggg cttgacgctt tttctctgca cacatacact ccccacgaac tgacacgggt 

186 ggcttcagcc ctcagaaaac ttggggcgcc acccctcaga gcgtggaaga gccgggcacg 

187 tgcagtcagg gcgtccctca tctcccgtgg ggggagagcg gccgtttgcg gtcgatatct 

188 cttcaattgg gcggtgaaga ccaagctcaa actcactcca ttgccggaag cgcgcctcct 

189 ggatttatcc agctggttca ccgtcggcgc cggcgggggc gacatttatc acagcgtgtc 

190 gcgtgcccga ccccgcttat tgctctttgg cctactccta ctttttgtag gggtaggcct 

191 tttcctactc cccgctcggt agagcggcac acattagcta cactccatag ctaactgtcc 

192 cttttttttt tttttttttt tttttttttt tttttttttt tttttttttt tttttttttt 

193 tttttttttt tttttttttt tttttctttt tttctctttt ccttctttct taccttattt 

194 tactttcttt cctggtggct ccatcttagc cctagtcacg gctagctgtg aaaggtccgt 

195 gagccgcatg actgcagaga gtgccgtaac tggtctctct gcagatcatg t 

196 <211> LENGTH: 3033 

197 <212> TYPE: PRT 

198 <213> ORGANISM: Hepatitis C virus 



7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
954 
96„ 
9660 ^ 
9711<210> 2 
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742 agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 

743 gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 

744 gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 

745 taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 942 0 



746 catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 



747 ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 



748 gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 



750 <211> LENGTH: 3015 

751 <212> TYPE: PRT 

752 <213> ORGANISM: Hepatitis C virus 



--> 754 


<400> SEQUENCE: 


4 
























755 


Met 


Ser 


Thr 


Asn 


Pro 


Lys 


Pro 


Gin 


Arg 


Lys 


Thr 


Lys 


Arg 


Asn 


Thr 


Asn 


756 


1 








5 










10 










15 




757 


Arg 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe 


Pro 


Gly 


Gly 


Gly 


Gin 


He 


Val 


Gly 


758 








20 










25 










30 






759 


Gly 


Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg 


Gly 


Pro 


Arg 


Leu 


Gly 


Val 


Arg 


Ala 


760 






35 










40 










45 








761 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser 


Gin 


Pro 


Arg 


Gly 


Arg 


Arg 


Gin 


Pro 


762 




50 










55 










60 










763 


He 


Pro 


Lys 


Asp 


Arg 


Arg 


Ser 


Thr 


Gly 


Lys 


Ser 


Trp 


Gly 


Lys 


Pro 


Gly 


764 


65 










70 










75 










80 


765 


Tyr 


Pro 


Trp 


Pro 


Leu 


Tyr 


Gly 


Asn 


Glu 


Gly 


Leu 


Gly 


Trp 


Ala 


Gly 


Trp 


766 










85 










90 










95 




767 


Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg 


Pro 


Ser 


Trp 


Gly 


Pro 


Asn 


Asp 


Pro 


768 








100 










105 










110 






769 


Arg 


His 


Arg 


Ser 


Arg 


Asn 


Val 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


770 






115 










120 










125 








771 


Gly 


Phe 


Ala 


Asp 


Leu 


Met 


Gly 


Tyr 


He 


Pro 


Val 


Val 


Gly 


Ala 


Pro 


Leu 


772 




130 










135 










140 










773 


Gly 


Gly 


Val 


Ala 


Arg 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


774 


145 










150 










155 










160 


775 


Gly 


Val 


Asn 


Phe 


Ala 


Thr 


Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


776 










165 










170 










175 




777 


Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 


He 


Thr 


Thr 


Pro 


Val 


Ser 


Ala 


Ala 


778 








180 










185 










190 






779 


Glu 


Val 


Lys 


Asn 


He 


Ser 


Thr 


Gly 


Tyr 


Met 


Val 


Thr 


Asn 


Asp 


Cys 


Thr 


780 






195 










200 










205 








781 


Asn 


Asp 


Ser 


He 


Thr 


Trp 


Gin 


Leu 


Gin 


Ala 


Ala 


Val 


Leu 


His 


Val 


Pro 


782 




210 










215 










220 










783 


Gly 


Cys 


Val 


Pro 


Cys 


Glu 


Lys 


Val 


Gly 


Asn 


Ala 


Ser 


Gin 


Cys 


Trp 


He 


784 


225 










230 










235 










240 


785 


Pro 


Val 


Ser 


Pro 


Asn 


Val 


Ala 


Val 


Gin 


Arg 


Pro 


Gly 


Ala 


Leu 


Thr 


Gin 


786 










245 










250 










255 




787 


Gly 


Leu 


Arg 


Thr 


His 


He 


Asp 


Met 


Val 


Val 


Met 


Ser 


Ala 


Thr 


Leu 


Cys 


788 








260 










265 










270 






789 


Ser 


Ala 


Leu 


Tyr 


Val 


Gly 


Asp 


Leu 


Cys 


Gly 


Gly 


Val 


Met 


Leu 


Ala 


Ala 


790 






275 










280 










285 










E--> 749 gcagatcatg t 



9611<210 
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12 84 tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 8640 

1285 tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 

1286 taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 

1287 accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 

1288 ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 

1289 tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 

1290 ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 

1291 ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 

1292 atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 

1293 gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 

1294 agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 

1295 gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 9300 

1296 gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 

1297 taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 

1298 catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 94t 

1299 ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 954C 

1300 gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 96( 
E--> 1301 gcagatcatg t 9611<210> 

1302 <211> LENGTH: 3015 

1303 <212> TYPE: PRT 

1304 <213> ORGANISM: Hepatitis C virus 



180 / 
>40 / 

500 €s 



E--> 1306 


<400> SEQUENCE: 


6 
























1307 


Met 


Ser 


Thr 


Asn 


Pro 


Lys 


Pro 


Gin 


Arg 


Lys 


Thr 


Lys 


Arg 


Asn 


Thr 


Asn 


1308 


1 








5 










10 










15 




1309 


Arg 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe 


Pro 


Gly 


Gly 


Gly 


Gin 


He 


Val 


Gly 


1310 








20 










25 










30 






1311 


Gly 


Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg 


Gly 


Pro 


Arg 


Leu 


Gly 


Val 


Arg 


Ala 


1312 






35 










40 










45 








1313 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser 


Gin 


Pro 


Arg 


Gly 


Arg 


Arg 


Gin 


Pro 


1314 




50 










55 










60 










1315 


He 


Pro 


Lys 


Asp 


Arg 


Arg 


Ser 


Thr 


Gly 


Lys 


Ser 


Trp 


Gly 


Lys 


Pro 


Gly 


1316 


65 










70 










75 










80 


1317 


Tyr 


Pro 


Trp 


Pro 


Leu 


Tyr 


Gly 


Asn 


Glu 


Gly 


Leu 


Gly 


Trp 


Ala 


Gly 


Trp 


1318 










85 










90 










95 




1319 


Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg 


Pro 


Ser 


Trp 


Gly 


Pro 


Asn 


Asp 


Pro 


1320 








100 










105 










110 






1321 


Arg 


His 


Arg 


Ser 


Arg 


Asn 


Val 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


1322 






115 










120 










125 








1323 


Gly 


Phe 


Ala 


Asp 


Leu 


Met 


Gly 


Tyr 


He 


Pro 


Val 


Val 


Gly 


Ala 


Pro 


Leu 


1324 




130 










135 










140 










1325 


Gly 


Gly 


Val 


Ala 


Arg 


Ala 


Leu 


Ala 


His 


Gly 


Val 


Arg 


Val 


Leu 


Glu 


Asp 


1326 


145 










150 










155 










160 


1327 


Gly 


Val 


Asn 


Phe 


Ala 


Thr 


Gly 


Asn 


Leu 


Pro 


Gly 


Cys 


Ser 


Phe 


Ser 


He 


1328 










165 










170 










175 




1329 


Phe 


Leu 


Leu 


Ala 


Leu 


Leu 


Ser 


Cys 


He 


Thr 


Thr 


Pro 


Val 


Ser 


Ala 


Ala 


1330 








180 










185 










190 






1331 


Glu 


Val 


Lys 


Asn 


He 


Ser 


Thr 


Gly 


Tyr 


Met 


Val 


Thr 


Asn 


Asp 


Cys 


Thr 


1332 






195 










200 










205 
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1826 ggaaagacct tctggaagac agtgtaacac caatagacac taccatcatg gccaagaacg 8040 

1827 aggttttctg cgttcagcct gagaaggggg gtcgtaagcc agctcgtctc atcgtgttcc 8100 

1828 ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta cgacgtggtt agcaagctcc 8160 

182 9 ccctggccgt gatgggaagc tcctacggat tccaatactc accaggacag cgggttgaat 822 0 

183 0 tcctcgtgca agcgtggaag tccaagaaga ccccgatggg gttctcgtat gatacccgct 8280 

1831 gttttgactc cacagtcact gagagcgaca tccgtacgga ggaggcaatt taccaatgtt 8340 

1832 gtgacctgga cccccaagcc cgcgtggcca tcaagtccct cactgagagg ctttatgttg 8400 

1833 ggggccctct taccaattca aggggggaaa actgcggcta ccgcaggtgc cgcgcgagcg 8460 

1834 gcgtactgac aactagctgt ggtaacaccc tcacttgcta catcaaggcc cgggcagcct 8520 

1835 gtcgagccgc agggctccag gactgcacca tgctcgtgtg tggcgacgac ttagtcgtta 8580 

1836 tctgtgaaag tgcgggggtc caggaggacg cggcgagcct gagagccttc acggaggcta 8640 

1837 tgaccaggta ctccgccccc cccggggacc ccccacaacc agaatacgac ttggagctta 8700 
183 8 taacatcatg ctcctccaac gtgtcagtcg cccacgacgg cgctggaaag agggtctact 8760 
183 9 accttacccg tgaccctaca acccccctcg cgagagccgc gtgggagaca gcaagacaca 8820 

1840 ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc ccccacactg tgggcgagga 8880 

1841 tgatactgat gacccatttc tttagcgtcc tcatagccag ggatcagctt gaacaggctc 8940 

1842 ttaactgtga gatctacgga gcctgctact ccatagaacc actggatcta cctccaatca 9000 

1843 ttcaaagact ccatggcctc agcgcatttt cactccacag ttactctcca ggtgaaatca 9060 

1844 atagggtggc cgcatgcctc agaaaacttg gggtcccgcc cttgcgagct tggagacacc 9120 

1845 gggcccggag cgtccgcgct aggcttctgt ccagaggagg cagggctgct atatgtggca 9180 

1846 agtacctctt caactgggca gtaagaacaa agctcaaact cactccaata gcggccgctg 9240 

1847 gccggctgga cttgtccggt tggttcacgg ctggctacag cgggggagac atttatcaca 93 00 

1848 gcgtgtctca tgcccggccc cgctggttct ggttttgcct actcctgctc gctgcagggg 9360 1 A Jf 



1849 taggcatcta cctcctcccc aaccgatgaa ggttggggta aacactccgg cctcttaagc 9420 

1850 catttcctgt tttttttttt tttttttttt tttttttctt tttttttttc tttcctttcc 9480 

1851 ttcttttttt cctttctttt tcccttcttt aatggtggct ccatcttagc cctagtcacg 9540 

1852 gctagctgtg aaaggtccgt gagccgcatg actgcagaga gtgctgatac tggcctctct 9600 
E--> 1853 gcagatcatg t 9611<210> 

1854 <211> LENGTH: 3015 

1855 <212> TYPE: PRT 

1856 <213> ORGANISM: Hepatitis C virus 



E--> 1858 


<400> SEQUENCE: 


8 
























1859 


Met 


Ser 


Thr 


Asn 


Pro 


Lys 


Pro 


Gin 


Arg 


Lys 


Thr 


Lys 


Arg 


Asn 


Thr 


Asn 


1860 


1 








5 










10 










15 




1861 


Arg 


Arg 


Pro 


Gin 


Asp 


Val 


Lys 


Phe 


Pro 


Gly 


Gly 


Gly 


Gin 


He 


Val 


Gly 


1862 








20 










25 










30 






1863 


Gly 


Val 


Tyr 


Leu 


Leu 


Pro 


Arg 


Arg 


Gly 


Pro 


Arg 


Leu 


Gly 


Val 


Arg 


Ala 


1864 






35 










40 










45 








1865 


Thr 


Arg 


Lys 


Thr 


Ser 


Glu 


Arg 


Ser 


Gin 


Pro 


Arg 


Gly 


Arg 


Arg 


Gin 


Pro 


1866 




50 










55 










60 










1867 


He 


Pro 


Lys 


Asp 


Arg 


Arg 


Ser 


Thr 


Gly 


Lys 


Ser 


Trp 


Gly 


Lys 


Pro 


Gly 


1868 


65 










70 










75 










80 


1869 


Tyr 


Pro 


Trp 


Pro 


Leu 


Tyr 


Gly 


Asn 


Glu 


Gly 


Leu 


Gly 


Trp 


Ala 


Gly 


Trp 


1870 










85 










90 










95 




1871 


Leu 


Leu 


Ser 


Pro 


Arg 


Gly 


Ser 


Arg 


Pro 


Ser 


Trp 


Gly 


Pro 


Asn 


Asp 


Pro 


1872 








100 










105 










110 






1873 


Arg 


His 


Arg 


Ser 


Arg 


Asn 


Val 


Gly 


Lys 


Val 


He 


Asp 


Thr 


Leu 


Thr 


Cys 


1874 






115 
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125 
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E- 



E-- 
E-- 



2368 ccttggccga gcttgccacc aaaagttttg gcagctcctc 

2369 acaatacgac aacatcctct gagcccgccc cttctggctg 

2370 agtcctattc ttccatgccc cccctggagg gggagcctgg 
23 71 ggtcatggtc gacggtcagt agtggggccg acacggaaga 

2372 cttattcctg gacaggcgca ctcgtcaccc cgtgcgctgc 

2373 tcaacgcact gagcaactcg ttgctacgcc atcacaatct 

2374 gcagtgcttg ccaaaggcag aagaaagtca catttgacag 

2375 attaccagga cgtgctcaag gaggtcaaag cagcggcgtc 

2376 tatccgtaga ggaagcttgc agcctgacgc ccccacattc 
2 3 77 atggggcaaa agacgtccgt tgccatgcca gaaaggccgt 

2378 ggaaagacct tctggaagac agtgtaacac caatagacac 

2379 aggttttctg cgttcagcct gagaaggggg gtcgtaagcc 

2380 ccgacctggg cgtgcgcgtg tgcgagaaga tggccctgta 

2381 ccctggccgt gatgggaagc tcctacggat tccaatactc 
23 82 tcctcgtgca agcgtggaag tccaagaaga ccccgatggg 

2383 gttttgactc cacagtcact gagagcgaca tccgtacgga 

2384 gtgacctgga cccccaagcc cgcgtggcca tcaagtccct 

2385 ggggccctct taccaattca aggggggaaa actgcggcta 

2386 gcgtactgac aactagctgt ggtaacaccc tcacttgcta 

2387 gtcgagccgc agggctccag gactgcacca tgctcgtgtg 
23 88 tctgtgaaag tgcgggggtc caggaggacg cggcgagcct 

2389 tgaccaggta ctccgccccc cccggggacc ccccacaacc 

2390 taacatcatg ctcctccaac gtgtcagtcg cccacgacgg 

2391 accttacccg tgaccctaca acccccctcg cgagagccgc 

2392 ctccagtcaa ttcctggcta ggcaacataa tcatgtttgc 

2393 tgatactgat gacccatttc tttagcgtcc tcatagccag 

2394 ttaactgtga gatctacgga gcctgctact ccatagaacc 

2395 ttcaaagact ccatggcctc agcgcatttt cactccacag 

2396 atagggtggc cgcatgcctc agaaaacttg gggtcccgcc 

2397 gggcccggag cgtccgcgct aggcttctgt ccagaggagg 
23 98 agtacctctt caactgggca gtaagaacaa agctcaaact 

2399 gccggctgga cttgtccggt tggttcacgg ctggctacag 

2400 gcgtgtctca tgcccggccc cgctggttct ggttttgcct 

2401 taggcatcta cctcctcccc aaccgatgaa ggttggggta 

2402 catttcctgt tttttttttt tttttttttt tttttttctt 

2403 ttcttttttt cctttctttt tcccttcttt aatggtggct 

2404 gctagctgtg aaaggtccgt gagccgcatg actgcagaga 

2405 gcagatcatg t 

2406 <211> LENGTH: 3015 

2407 <212> TYPE: PRT 

2408 <213> ORGANISM: Hepatitis C virus 
2410 <210> SEQ ID NO: 

2410 <400> SEQUENCE: 10 

2411 Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr 

2412 15 10 

2413 Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly 

2414 20 25 

2415 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg 

2416 35 40 



aacttccggc 
cccccccgac 
ggatccggat 
tgtcgtgtgc 
ggaagaacaa 
ggtgtattcc 
actgcaagtt 
aaaagtgaag 
agccaaatcc 
agcccacatc 
taccatcatg 
agctcgtctc 
cgacgtggtt 
accaggacag 
gttctcgtat 
ggaggcaatt 
cactgagagg 
ccgcaggtgc 
catcaaggcc 
tggcgacgac 
gagagccttc 
agaatacgac 
cgctggaaag 
gtgggagaca 
ccccacactg 
ggatcagctt 
actggatcta 
ttactctcca 
cttgcgagct 
cagggctgct 
cactccaata 

cgggggagac 

actcctgctc 
aacactccgg 
tttttttttc 
ccatcttagc 
gtgctgatac 



attacgggcg 
tccgacgttg 
ctcagcgacg 
tgctcaatgt 
aaactgccca 
accacttcac 
ctggacagcc 
gctaacttgc 
aagtttggct 
aactccgtgt 
gccaagaacg 
atcgtgttcc 
agcaagctcc 
cgggttgaat 
gatacccgct 
taccaatgtt 
ctttatgttg 
cgcgcgagcg 
cgggcagcct 
ttagtcgtta 
acggaggcta 
ttggagctta 
agggtctact 
gcaagacaca 
tgggcgagga 
gaacaggctc 
cctccaatca 
ggtgaaatca 
tggagacacc 
atatgtggca 
gcggccgctg 
atttatcaca 
gctgcagggg 
cctcttaagc 
tttcctttcc 
cctagtcacg 
tggcctctct 



7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
954( 
9600 1 
9611<210> 



/ 



10 



Lys Arg Asn Thr Asn 
15 

Gly Gin lie Val Gly 
30 

Leu Gly Val Arg Ala 
45 
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3632 gggtgcgcct acacaggttt gcgccccctt gcaagccctt gctgcgggag gaggtatcat 6780 

3633 tcagagtagg actccacgag tacccggtgg ggtcgcaatt accttgcgag cccgaaccgg 6840 

3634 acgtagccgt gttgacgtcc atgctcactg atccctccca tataacagca gaggcggccg 6900 

3635 ggagaaggtt ggcgagaggg tcaccccctt ctatggccag ctcctcggct agccagctgt 6960 

3636 ccgctccatc tctcaaggca acttgcaccg ccaaccatga ctcccctgac gccgagctca 7020 

3637 tagaggctaa cctcctgtgg aggcaggaga tgggcggcaa catcaccagg gttgagtcag 7080 

3638 agaacaaagt ggtgattctg gactccttcg atccgcttgt ggcagaggag gatgagcggg 7140 
363 9 aggtctccgt acctgcagaa attctgcgga agtctcggag attcgcccgg gccctgcccg 72 00 

3640 tctgggcgcg gccggactac aaccccccgc tagtagagac gtggaaaaag cctgactacg 72 60 

3641 aaccacctgt ggtccatggc tgcccgctac cacctccacg gtcccctcct gtgcctccgc 7320 

3642 ctcggaaaaa gcgtacggtg gtcctcaccg aatcaaccct atctactgcc ttggccgagc 7380 

3643 ttgccaccaa aagttttggc agctcctcaa cttccggcat tacgggcgac aatacgacaa 7440 

3644 catcctctga gcccgcccct tctggctgcc cccccgactc cgacgttgag tcctattctt 7500 

3645 ccatgccccc cctggagggg gagcctgggg atccggatct cagcgacggg tcatggtcga 7560 

3646 cggtcagtag tggggccgac acggaagatg tcgtgtgctg ctcaatgtct tattcctgga 762 0 

3647 caggcgcact cgtcaccccg tgcgctgcgg aagaacaaaa actgcccatc aacgcactga 7680 

3648 gcaactcgtt gctacgccat cacaatctgg tgtattccac cacttcacgc agtgcttgcc 7740 

3649 aaaggcagaa gaaagtcaca tttgacagac tgcaagttct ggacagccat taccaggacg 7800 

3650 tgctcaagga ggtcaaagca gcggcgtcaa aagtgaaggc taacttgcta tccgtagagg 7860 

3651 aagcttgcag cctgacgccc ccacattcag ccaaatccaa gtttggctat ggggcaaaag 7920 

3652 acgtccgttg ccatgccaga aaggccgtag cccacatcaa ctccgtgtgg aaagaccttc 7980 

3653 tggaagacag tgtaacacca atagacacta ccatcatggc caagaacgag gttttctgcg 8040 

3654 ttcagcctga gaaggggggt cgtaagccag ctcgtctcat cgtgttcccc gacctgggcg 8100 

3655 tgcgcgtgtg cgagaagatg gccctgtacg acgtggttag caagctcccc ctggccgtga 8160 

3656 tgggaagctc ctacggattc caatactcac caggacagcg ggttgaattc ctcgtgcaag 8220 

3657 cgtggaagtc caagaagacc ccgatggggt tctcgtatga tacccgctgt tttgactcca 82 80 

3658 cagtcactga gagcgacatc cgtacggagg aggcaattta ccaatgttgt gacctggacc 8340 

3659 cccaagcccg cgtggccatc aagtccctca ctgagaggct ttatgttggg ggccctctta 8400 

3660 ccaattcaag gggggaaaac tgcggctacc gcaggtgccg cgcgagcggc gtactgacaa 8460 

3661 ctagctgtgg taacaccctc acttgctaca tcaaggcccg ggcagcctgt cgagccgcag 8520 

3662 ggctccagga ctgcaccatg ctcgtgtgtg gcgacgactt agtcgttatc tgtgaaagtg 8580 

3663 cgggggtcca ggaggacgcg gcgagcctga gagccttcac ggaggctatg accaggtact 8640 

3664 ccgccccccc cggggacccc ccacaaccag aatacgactt ggagcttata acatcatgct 8700 

3665 cctccaacgt gtcagtcgcc cacgacggcg ctggaaagag ggtctactac cttacccgtg 8760 

3666 accctacaac ccccctcgcg agagccgcgt gggagacagc aagacacact ccagtcaatt 882 0 

3667 cctggctagg caacataatc atgtttgccc ccacactgtg ggcgaggatg atactgatga 8880 

3668 cccatttctt tagcgtcctc atagccaggg atcagcttga acaggctctt aactgtgaga 8940 

3669 tctacggagc ctgctactcc atagaaccac tggatctacc tccaatcatt caaagactcc 9000 

3670 atggcctcag cgcattttca ctccacagtt actctccagg tgaaatcaat agggtggccg 9060 

3671 catgcctcag aaaacttggg gtcccgccct tgcgagcttg gagacaccgg gcccggagcg 9120 

3672 tccgcgctag gcttctgtcc agaggaggca gggctgccat atgtggcaag tacctcttca 9180 

3673 actgggcagt aagaacaaag ctcaaactca ctccaatagc ggccgctggc cggctggact 9240 

3674 tgtccggttg gttcacggct ggctacagcg ggggagacat ttatcacagc gtgtctcatg 9300 

3675 cccggccccg ctggttctgg ttttgcctac tcctgctcgc tgcaggggta ggcatctacc 9360 

3676 tcctccccaa ccgatgaagg ttggggtaaa cactccggcc tcttaagcca tttcctgttt 942( 

3677 tttttttttt tttttttttt tttttctttt tttttttctt tcctttcctt ctttttttcc 9480 

3678 tttctttttc ccttctttaa tggtggctcc atcttagccc tagtcacggc tagctgtgaa 954( 
E--> 3679 aggtccgtga gccgcatgac tgcagagagt gctgatactg gcctctctgc agatcatgt 9599<210> 68 

3680 <211> LENGTH: 3011 
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10 & 
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4223 cgcgctaagc tactgtccca gggggggagg gccgccactt gtggcagata cctctttaac 9180 

4224 tgggcagtaa ggaccaagct taaactcact ccaatcccgg ccgcgtccca gctggacttg 9240 

4225 tctggctggt tcgtcgctgg ttacagcggg ggagacatat atcacagcct gtctcgtgcc 9300 

4226 cgaccccgct ggtttccgtt gtgcctactc ctactttctg taggggtagg catttacctg 9360 

4227 ctccccaacc gatgaacggg gagctaacca ctccaggcct taagccattt cctgtttttt 9420 

4228 tttttttttt tttttttttt ttcttttttt ttttctttcc tttccttctt tttttccttt 948C 

4229 ctttttccct tctttaatgg tggctccatc ttagccctag tcacggctag ctgtgaaagg 954C 
E--> 4230 tccgtgagcc gcatgactgc agagagtgct gatactggcc tctctgcaga tcatgt 9596<210> 70 

4231 <211> LENGTH: 3010 

4232 <212> TYPE: PRT 

4233 <213> ORGANISM: Hepatitis C virus 
E--> 4235 <210> SEQ ID NO: 

E--> 4235 <400> SEQUENCE: 70 

4236 Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 

4237 15 10 15 

4238 Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 

4239 20 25 30 

4240 Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 

4241 35 40 45 

4242 Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 

4243 50 55 60 

4244 He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 

4245 65 70 75 80 

4246 Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 

4247 85 90 95 

4248 Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 

4249 100 105 110 

4250 Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 

4251 115 120 125 

4252 Gly Phe Ala Asp Leu Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu 

4253 130 135 140 

4254 Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 

4255 145 150 155 160 

4256 Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 

4257 165 170 175 

4258 Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr 

4259 180 185 190 

4260 Glu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys Ser 

4261 195 200 205 

4262 Asn Ser Ser He Val Tyr Glu Ala Ala Asp Val He Met His Thr Pro 

4263 210 215 220 

4264 Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ser Ser Arg Cys Trp Val 

4265 225 230 235 240 

4266 Ala Leu Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 

4267 245 250 255 

4268 Thr He Arg Arg His Val Asp Leu Leu Val Gly Thr Ala Ala Phe Cys 

4269 260 265 270 

4270 Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser He Phe Leu Val Ser 

4271 275 280 285 
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